Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems

نویسندگان

  • Masaki Katsumaru
  • Kazunori Komatani
  • Tetsuya Ogata
  • Hiroshi G. Okuno
چکیده

Users often abbreviate long words when using spoken dialogue systems, which results in automatic speech recognition (ASR) errors. We define abbreviated words as sub-words of an original word and add them to the ASR dictionary. The first problem we face is that proper nouns cannot be correctly segmented by general morphological analyzers, although long and compound words need to be segmented in agglutinative languages such as Japanese. The second is that, as vocabulary size increases, adding many abbreviated words degrades the ASR accuracy. We have developed two methods, (1) to segment words by using conjunction probabilities between characters, and (2) to adjust occurrence probabilities of generated abbreviated words on the basis of the following two cues: phonological similarities between the abbreviated and original words and frequencies of abbreviated words in Web documents. Our method improves ASR accuracy by 34.9 points for utterances containing abbreviated words without degrading the accuracy for utterances containing original words.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems

Users often abbreviate long words when using spoken dialogue systems, which results in automatic speech recognition (ASR) errors. We define abbreviated words as sub-words of the original word, and add them into an ASR dictionary. The first problem is that proper nouns cannot be correctly segmented by general morphological analyzers, although long and compounded words need to be segmented in agg...

متن کامل

Adaptive Spoken Dialogue Systems

Adaptive systems cover a broad range of interactive systems which adjust to new tasks, situations, users or expressions. These systems identify and classify relevant features to develop over time, adjusting their behaviour to different users and situations. The topic of this paper is adaptation in spoken dialogue systems based on features in the dialogue. These systems automatically extract dia...

متن کامل

Viability of a Simple Dialogue Act Scheme for a Tactical Questioning Dialogue System

User utterances in a spoken dialogue system for tactical questioning simulation were matched to a set of dialogue acts generated automatically from a representation of facts as 〈object, attribute, value〉 triples and actions as 〈character, action〉 pairs. The representation currently covers about 50% of user utterances, and we show that a few extensions can increase coverage to 80% or more. This ...

متن کامل

Automatically predicting dialogue structure using prosodic features

Spoken dialogue systems need to track dialogue structure in order to conduct sensible conversations. In previous work, we used only a shallow analysis of past dialogue in predicting the current dialogue act. Here we show that a hierarchical analysis of dialogue structure can significantly improve dialogue act recognition. Our approach is to integrate dialogue act recognition with speech recogni...

متن کامل

Towards Natural Clarification Questions in Dialogue Systems

Clarifications are often necessary for maintaining human-human as well as human-machine dialogue. However, clarification questions asked by Spoken Dialogue Systems (SDS) are very different from clarification questions asked in natural human interaction. While in human-human dialogues, speakers ask targeted questions using contextual information, SDS ask generic clarifications such as please rep...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009